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Abstract 

Main Objective:\Ne examine the extent of taxonomic and biogeographical uncertainty in a well-studied group of Australian 
Lepidoptera, the hawkmoths (Sphingidae). 

Methods: We analysed the diversity of Australian sphingids through the comparative analysis of their DNA barcodes, 
supplemented by morphological re-examinations and sequence information from a nuclear marker in selected cases. The 
results from the analysis of Australian sphingids were placed in a broader context by including conspecifics and closely 
related taxa from outside Australia to test taxonomic boundaries. 

Results: Our results led to the discovery of six new species in Australia, one case of erroneously synonymized species, and 
three cases of synonymy. As a result, we establish the occurrence of 75 species of hawkmoths on the continent. The analysis 
of records from outside Australia also challenges the validity of current taxonomic boundaries in as many as 18 species, 
including Agrius convolvuli (Linnaeus, 1758), a common species that has gained adoption as a model system. Our work has 
revealed a higher level of endemism than previously recognized. Most (90%) Australian sphingids are endemic to the 
continent (45%) or to Australia, the Pacific Islands and the Papuan and Wallacean regions (45%). Only seven species (10%) 
have ranges that extend beyond this major biogeographical boundary toward SE Asia and other regions of the Old World. 

Main Conc/usions:Th'\s study has established that overlooked cryptic diversity and inaccurate species delineation produced 
significant misconceptions concerning diversity and distribution patterns in a group of insects that is considered well 
known taxonomically. Because DNA barcoding represents a straightforward way to test taxonomic boundaries, its 
implementation can improve the accuracy of primary diversity data in biogeography and conservation studies. 
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Introduction 

"The crux of the problem is that if we don't know what is out there or 
how widely species are distributed, how can we convince people about 
the reality and form of the biodiversity crisis?" Riddle et al. [1]. 

As in many other groups of Australian organisms, the insect 
fauna of this continent is renowned for its diversity, uniqueness, 
and many iconic endemics [2] . However, in contrast to plants and 
vertebrates, the diversity of Australian insects remains poorly 
documented. There are about 62,000 described species of insects 
on the continent, representing from 15% to 50% of the total fauna 
depending on estimates [3-5] . The level of species endemism is 



difficult to evaluate and undoubtedly varies among insect orders 
[2], but is generally considered to be very high. The overall value 
may reach 90% [3], and many families include only endemics [2]. 
However, much effort is still needed to fill gaps in taxonomy and 
to extend understanding of taxon distributions, deficits that have 
come to be termed respectively the Linnean and Wallacean 
shortfalls [6]. 

The integration of molecular methods into taxonomic practice 
has improved our capacity to understand and describe diversity 
[7,8] and has been proposed as a way to help increase the pace of 
species discovery and description [9,10]. Although the application 
of integrative approaches to the Australian fauna is still in its 
infancy, studies have already revealed spectacular levels of 
overlooked and cryptic species in some groups of vertebrates 
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[11,12]. Similar cases have also been reported in invertebrates, 
typically revealing species that raise the already high level of 
endemism in Australia [13-15]. Particularly comprehensive efforts 
have been directed toward Australian Lepidoptera. Much of this 
information is available in an indexed public reference library of 
DNA barcode sequences [16,17] (database accessible in BOLD at 
www.boldsystems.org) and via the current taxonomic system for 
the complete Australian fauna [18] (see also http://www. 
lepbarcoding.org/ australia). 

The order Lepidoptera includes about 130 families whose 
diversity varies dramatically — from a single to more than 24,000 
species [19]. Because the current count of approximately 160,000 
described species is thought to reflect less than one-third of the 
total [20], diversity estimates for some families, especially smaller- 
bodied taxa, are very uncertain. However, other families, such as 
the Sphingidae, have attracted so much taxonomic interest that 
present estimates of diversity are thought to be comprehensive 
except for 'frontier' regions. The last global conspectus on the 
Sphingidae recognized 1278 species [21], but this count has now 
risen by about 200 species [19] with most new taxa deriving from 
hyperdiverse tropical faunas. In Australia, Europe and North 
America, the pace of species discovery has slowed, suggesting that 
species counts are complete or nearly so. 

The Sphingidae of Australia 

Australia provides a good example of a continental fauna of 
sphingids with a mature taxonomy. Collection programs have 
been wide ranging, morphological studies have been careful, and 
species discovery has slowed: 53 species were described by the end 
of the 19 th century, 62 by 1927, but only two more taxa were 
added during the next 70 years [22]. Although many families of 
Australian Lepidoptera are dominated by endemic species [2] , the 
sphingid fauna is thought to include a mix of endemics and species 
with distributions extending into the Pacific islands, Southeast Asia 
and even into Europe and Africa. 

The descriptions of some wide-ranging species in the Australian 
fauna date back to the launch of Linnaean nomenclature (e.g. 
Agnus convolvuli (Linnaeus, 1758) and Hippotion celerio (Linnaeus, 
1 758)). Other species were described only slightly later from Asia 
(e.g. Theretra nessus (Drury, 1773) and Mephele hespera (Fabricius, 
1775), both described from India). The first sphingids to be 
described from Australia were the endemics Coequosa australasiae 
(Donovan, 1805), Coequosa triangularis (Donovan, 1805) and Cizara 
ardeniae (Lewin, 1805). Moulds [22] provided an historical 
overview of work on Australian sphingids, listing 64 species in 
22 genera for the continent. Although only two species [Imber 
tropicus (Moulds, 1983) and Psilogramma argos Moulds & Lane, 1999) 
were described from 1930-2000, eleven species and one subspecies 
have been described since then, including seven species of 
Psilogramma, a complex genus whose species count exploded 
globally from four to more than 60 over the past decade [21,23- 
25], although perhaps only about half of these are actually valid 
species (I.J. Kitching, unpublished data). Three new genera, Imber 
Moulds et al, 2010, Pseudoangonyx Eitschberger, 2011, and 
Cerberonoton Zolotuhin & Ryabov, 2012 were recently erected for 
rather distinctive species, with a fourth new genus being 
established for a newly encountered species (£acria vojtechi Haxaire 
& Melichar, 2003) demonstrating that striking discoveries can still 
occur in this fauna. Recent nomenclatural changes within the 
genus Macroglossum also affected the names of two Australian 
species [26]. In his checklist of Australian Sphingidae, Moulds [22] 
omitted subspecies names, concealing the distinctiveness of the 
Australian fauna from those of Neighbouring Southeast Asia and 
Pacifica. In actuality, many Australian sphingids have been 



assigned to an endemic subspecies because of their morphological 
divergence from the nominotypical subspecies. At least some of 
these cases may reflect cases of long separation that merit 
recognition as distinct species, but their status has not yet been 
investigated in detail. 

The delineation of species/subspecies and the existence of 
cryptic or overlooked undescribed species can be a serious 
impediment for biogeography and conservation studies [6]. This 
"Linnean shortfall" represents a formidable challenge for research 
in these disciplines [27]. Building on recent access to molecular 
tools for species identification and discrimination [28,29] and the 
success of integrative approaches in taxonomy [30-32], the 
present study employs sequence diversity in the DNA barcode 
region of the mitochondrial gene GOI to investigate the diversity 
of Australian sphingids and their distributions. We begin by 
examining patterns of sequence diversity within the Australian 
fauna to ascertain the efficacy of DNA barcodes to discriminate 
known species and reveal overlooked diversity. We then place 
these results in a broader geographic context by examining 
patterns of sequence divergence between Australian populations 
and their conspecific lineages outside the continent. 

Material and Methods 

Specimen Sampling 

More than 1200 Australian sphingids were sampled, most from 
three collections - the Biodiversity Institute of Ontario (361), the 
Zoologische Staatssammlung Mtinchen (300), and the Australian 
National Insect Collection (112). The remaining specimens derive 
from 17 other collections (see details in Table SI). All institutions 
and collections granted us permission to access and study the 
material used in this work. Sampling aimed to maximize the 
geographic coverage for each species or subspecies to examine the 
extent of barcode variation across its distribution. In the most 
common species, a large number of specimens were sampled 
(more than 30 for 1 1 species, 113 for Agnus convolvuli), but fewer 
than five specimens were examined for 20 species/subspecies that 
are rare. Each record was given a unique specimen identifier 
(SamplelD) and sequence identifier (ProcessID) to provide a direct 
link between voucher specimens and DNA barcode records. 
Collection data, a photograph and ancillary information, such as 
the collection holding each specimen, were compiled in BOLD for 
each individual record. 

DNA Sequencing 

The methods used for generation of DNA sequences follow 
standard protocols designed for amplification and sequencing of 
the standard DNA barcode. In four species, the same DNA 
extracts were also used for the sequencing of the D2 expansion 
segment of the 28S rDNA gene. Details of these methods are given 
as Supporting Information (Appendix SI). 

Data Analysis 

A nearest Neighbour analysis was conducted with BOLD using 
either a dataset restricted to sphingid records from Australia or the 
full database for world sphingids, including all COI barcode 
sequences longer than 500 bp. The latter database includes about 
23 K sequences, 1.8 K species and subspecies (as of September 
12 th , 2013). BOLD was also used for sequence alignment and 
calculation of genetic divergences and Neighbour Joining (NJ) 
trees using a K2P distance model [33] after alignment. These trees 
were imported in iTOL [34] to exploit its capacity for the 
visualization of large trees. In two cases where DNA barcode 
analysis suggested potential cases of overlooked diversity, we also 
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generated "geo-phylogenies" using GenGIS 2.2.0 [35] to better 
visualize the geographical structure of genetic variation. MEGA 
5.2.2 [36] was also used to produce character-based trees using the 
maximum parsimony (MP) criterion (all settings at default, with 
the complete deletion option for missing data) in cases of species- 
complexes where genetic distances are shallow and require a more 
rigorous method for reconstructing relationships between termi- 
nals. The topology of these DNA barcode "gene trees" was 
compared with the "species tree" inferred from current taxonomy 
based on morphology. Attention was directed toward an 
examination of the incidence of reciprocal monophyly rather 
than relationships between species. The division of a species into 
two or more clusters of specimens was considered a potential 
indicator of cryptic diversity (candidate species) when the species 
included two or more monophyletic clusters with more than 2% 
divergence [37,38] in their DNA barcode sequences. This 
threshold was used as an operational criterion to trigger in-depth 
study (comparative morphology, sequencing of a nuclear marker), 
but cases of shallower divergences are also reported and discussed 
when the genetic clusters match existing or previously proposed 
taxonomic divisions (at species or subspecies level). Candidate 
species were considered as Confirmed Candidate Species (CCS) 
when independent evidence (e.g. morphology, nuclear gene) also 
supported their distinction, or as Deep Conspecific Lineages 
(DCL) when no morphological differences or divergence at 28S 
rDNA were detected [39,40]. We note, however, that there is no 
objective way to confirm the status of any geographically isolated 
CCS as distinct species (or subspecies). As pointed out by Mutanen 
et al. [38], taxonomists working on Lepidoptera when dealing with 
closely related but allopatric populations have traditionally 
assigned them to different subspecies when slight morphological 
differences are apparent (usually in wing patterns), but to distinct 
species when morphological differences are greater (usually both in 
wing patterns and genitalia). The consideration of genetic 
differences as a basis for species delineation in Lepidoptera is 
recent. As a result there is no established practice regarding the 
taxonomic treatment of units currently diagnosed only through 
DNA data, whether their divisions are derived from the analyses of 
a single or multiple DNA markers. Our strategy has involved close 
collaboration with taxonomists studying Australian hawkmoths, 
highlighting all potential or confirmed candidate species for 
further morphological, ecological and biogeographical investiga- 



Data Accessibility (BOLD, GenBank) 

Data on specimen vouchers, sequence data including trace files 
and registered primer pairs are available on BOLD within the 
datasets SPH01AUS (dx.doi.org/10.5883/DATASET- 
SPH01AUS) for Australian records, and SPH02AUS 
(cLx.doi.org/10.5883/DS-SPH02AUS) for conspecifics and most 
closely related taxa outside Australia. All sequences were also 
deposited in GenBank for COI DNA barcodes and for 28S rDNA 
sequences (see Tables S1-S3). 

Results 

Diversity of Australian Sphingids 

In total, DNA barcodes were obtained from 1054 specimens 
representing 70 of the 72 valid sphingid species currendy 
recognized from Australia (Table 1). Belonging to 25 genera, 
these specimens were sampled from across the continent with the 
majority from Queensland (56%) and New South Wales (24%), 
where diversity is the highest and collecting efforts have been 
greatest (Fig. 1). This total includes four species (Psilogramma exigua, 
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Figure 1 . Geographical sampling. Distribution of the 1 054 specimens of Australian Sphingidae with DNA barcodes analysed in this study (yellow) 
and of the 735 additional specimens of conspecifics (including hetero-subspecific taxa) from outside Australia and relevant closely related species 
with DNA barcodes (see details of record lists in Table S1-S3). 
doi:10.1371/journal.pone.0101108.g001 



P. penumbra, Hopliocnema ochra, H. lacunosa) that were recently 
described on the basis of their DNA barcode divergence and 
correlated morphological differences [23,41,42]. The genus 
Psilogramma requires a global taxonomic and nomenclatural 
revision [41]. Nevertheless, but for the possible exceptions of P. 
menephron and P. papuensis, the nomenclature of the seven species 
found in Australia should remain stable if considering the three 
cases of synonymy revealed here by the analysis of the relevant 
type specimens (P. hausmanni syn. nov., P. gbriosa syn. nov., P. koalae 
syn. nov.; see Table 1, footnotes 4—6). 

NJ analysis of the barcode sequences indicated that represen- 
tatives of each of the 70 species formed a cohesive and reciprocally 
monophyletic cluster (Fig. 2; see Fig. SI for details). The mean 
within-genus divergence (7.13%, sd. = 0.007%) is considerably 
higher than the mean within-species distance (0.3%, 
sd. = 0.007%), matching the pattern of variation already reported 
in other families of Lepidoptera [43]. The lowest minimum 
interspecific distance was 1.19% between Macroglossum rectans and 
M. hirundo mans, but the clusters for these species were still 
separate, and both taxa possessed a maximum intraspecific 
divergence of 1.0%. The next closest pair, Hippotion celerio and H. 
velox, had 2.7% divergence, while Coequosa australasiae and C. 
triangularis had the greatest nearest-Neighbour divergence (7.2%). 
Intraspecific variation among the Australian samples (Fig. 2 and 
Table 1) was generally low, even in some of the most heavily 
sampled species such as Agrius convolvuli (Dmax = 0.5%, N = 1 1 3) or 
Theretra latreillii (Dmax =0.5%, N = 56). However, intraspecific 
divergence did exceed 2.0% in four species (Table 1): Imber tropicus 
(2%), Eupanacra spkndens (2.2%), Acosmeryx anceus (6.4%), and 
Synoecha marmorata (9.7%) with each taxon represented by two 
clusters (Fig. 2, Fig. SI). The first two cases involve intraspecific 
distances of approximately 2%, and we treat them as cases of DCL 
(Fig. 2, highlighted in pale yellow), a conclusion supported by their 
lack of morphological divergence and the absence in E. spkndens of 
congruent variation in 28S rRNA sequences (Fig. S2). In the other 
two cases, intraspecific divergence was substantially higher (Fig. 2, 
highlighted in red), and other lines of evidence support their status 
as CCS. Morphological investigation of the split in S. marmorata 
[44] revealed that one of its lineages represents an undescribed 
species (formally described as Coenotes Moulds & Melichar, [2014] 
during the review of this study). The original generic and specific 
identifications were erroneous, and the affinity of these specimens 
with the genus Coenotes is confirmed by morphology and is 



supported by the DNA barcode data (representatives of the new 
taxon show just 3.3% minimum genetic distance from C. 
eremophilae (Fig. SI)). The fourth species, A. anceus, includes two 
lineages with 6.4% divergence, one broadly distributed in eastern 
Australia, the other restricted to northern Queensland. Interest- 
ingly, this split matches a tentative separation already proposed in 
the 1970s by taxonomists Alan Hayes and Jean-Marie Cadiou (IJ. 
Kitching, personal communication) and materialized through 
specimens sorted accordingly in collections at the Natural History 
Museum, London. Re-examination of series of specimens and 
rearing revealed several morphological characters (differences in 
size and wing colouration, as well as larval and pupal morphology 
(J.P. Tutde & M.S. Moulds, personal communication)) and there is 
also one diagnostic nucleotide substitution between the taxa at 
position 550 in the analysed segment of 28S rRNA (Fig. S2). 
Furthermore, while the two DNA barcode clusters of A. anceus 
appear sympatric in northern Queensland (Fig. S3), they have 
micro-allopatric distributions; one cluster is restricted to scler- 
ophyll forests along the east coast of Australia, and the other has 
only been collected in rain forests in Eastern Cape York Peninsula 
and Papua New Guinea. Lasdy, a shallower split (1.4%) among 
Australian records of Hippotion brennus suggests the validity (in 
agreement with similar suggestions by Darge [45] and Riotte [46]) 
of the synonymized taxon, Johanna (Kirby, 1877), because the three 
sequenced specimens with a typical johanna-like morphology form a 
distinct DNA barcode group (Figs S 1 and S4) that is also separated 
geographically. Overall, considering this last and the two 
additional new species in the genera Acosmeryx and Coenotes, these 
results bring the total of Australian sphingid species to 75. 

A Global Perspective on Australian Sphingids 

Of the 75 species of sphingids that occur in Australia, 30 of 
them (40%) are endemic to the continent (Table S4). Another 45 
species then occur in both Australia and another geographic 
region, of which four are represented by an endemic subspecies in 
Australia (the nominal subspecies in two cases). This brings to 34 
(45%) the number of endemic sphingid taxa in Australia, at species 
or subspecies level. For all of the 45 species distributed in Australia 
and outside the continent, we examined the patterns of DNA 
barcode variation after including specimens from other regions. 
This involved the combined analysis of sequences from 1054 
Australian specimens, 599 conspecifics from outside Australia, and 
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Figure 2. DNA barcode variation in Australian Sphingidae. Neighbour Joining tree based on K2P distances for 1054 DNA barcodes of 70 
species of Australian sphingid moths. For each species the maximum intraspecific distance is given in the central part of the tree as histograms at the 
tip of branches (ranging from 0 to 9.7%, the circle line mark the 2% threshold); colour ranges highlight cases of synonymy (in blue), cryptic diversity 
(CCS, in red) as well as these species with 2% or more intraspecific distance (DCL, yellow) but not proven to represent two different species. Numbers 
refer to species numbers as listed in Table 1 . An interactive and fully explorable version of the tree is available at http://itol.embl.de/shared/rodroug. 
(See also Fig. S1J. 

doi:1 0.1 371 /journal.pone.01 01 1 08.g002 



136 records for closely related species from other regions (Table 1 
and Tables S1-S3). The 599 conspecific records include 112 con- 
subspecific records and 140 hetero-subspecific records for 18 
polytypic Australian species, and 347 records for 26 monotypic 
Australian species or species with no available record for distinct 
subspecies. The results of the analysis of this combined dataset are 
given in Fig. 3 (see Fig. S5 for details). 

This analysis revealed an additional very deep split in Eupanacra 
spkndms and ten more cases of deep (>2%) intraspecific 
divergences involving Agrius convohuli, Cephonodes picus, Daphnis 
placida, Gnathothlibus eras, Hippotion brennus, Macroglossum cotythus, M. 
tenebrosa, Nephele hespera, Psilogramma menephron and Theretra nessus 
(Table 1, Fig. 3). The two DNA barcode clusters in Agrius convohuli 
show deep divergence (Dmax = 5.37%), and clear geographical 
segregation (Fig. S6). Preliminary comparisons of wing patterns 
and male genitalia have been inconclusive and further morpho- 
logical study is required, but the two lineages show three 



diagnostic substitutions (positions 317, 486 and 505) in sequences 
for the 28S rDNA gene (Fig. S2), supporting their status as 
different species. Three other species with deep splits (E. spkndms 
(the West New Britain record only), T. nessus, H. brennus) showed 
morphological differences between representatives of the divergent 
DNA barcode clusters (e.g. Fig. S4), suggesting that they too 
represent cases of overlooked diversity. Other cases also merit 
deeper investigation. For example, the observed genetic diver- 
gences in C. picus challenge the proposed synonymy of C. 
cunninghami (Walker, 1856) [21] described from Australia. In this 
case, as well as three others (G. eras, D. placida, jV. hespera), the 
genetic splits are associated with geographical distances of several 
thousand kilometres across major biogeographical boundaries. In 
M. tenebrosa, DNA barcodes reveal the genetic distinctness of the 
representatives from Sulawesi (Fig. S5); morphological compari- 
sons are yet to be conducted, and no variation was observed in the 
few available sequences of the D2 fragment of the 28S rDNA gene 
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8G>2% 

I . Acosmeryx anceus 
7. Cephonodes picus 

I I . Daphnis placida 

13. Eupanacra splendens 

1 5. Gnalhothlibus eras 

1 6. Hippolion brennus 

23. Macroglossum corythus 
32. Macroglossum tenebrosa 
35. Nephele hespera 
43. Theretra nessits 
46. Theretra radiosa 

55. Imber tropicus 

56. Agrius convolvuli 

68. Psilogramma menephron 
7 1 . Synoecha marmorata 



Asp<2% 

1 5. Gnalhothlibus eras 

16. Hippolion brennus 
1 8. Hippothion rosetta 

44. Theretra oldenlandiae lewini 

45. Theretra queenslandi 
52. Ambulyx wildei 

59. Coenotes eremophilae 
68. Psilogramma menephron 



Figure 3. DNA barcode variation in the geographically extended dataset. Neighbour Joining tree based on K2P distances for the 1054 
Australian sphingid records analysed in Fig. 2 augmented by 735 records for conspecifics, co-subspecifics and closely related species from outside 
Australia. In the centre of the tree, the inner (red) and outer (green) histograms represent the maximum intraspecific distance (8G) and the distance to 
the nearest heterospecific Neighbour on BOLD (Asp). The species with 8G>2% and Asp<2% are highlighted in the tree with red and green colour 
ranges, respectively; those species are also listed with numbers following species numbers in Table 1. An interactive and fully explorable and 
searchable version of this tree can be accessed at http://itol.embl.de/shared/rodroug. (See also Fig. S5.). 
doi:10.1371/journal.pone.0101108.g003 
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(Fig. S2). Finally, both M. corythus and P. menephron represent species 
complexes requiring a comprehensive revision to establish the 
validity of the many available names; there are nine valid 
subspecies names [21] for the former, while we found that the 
sequenced holo types of 10 species within genus Psilogramma fall 
within our set of DNA barcode clusters forming the P. menephron 
complex (Fig. S5). 

In contrast to these cases of deep sequence divergence within 
species, the extended dataset revealed eight cases where Australian 
taxa showed less than 2% divergence (Table S5) from non- 
Australian specimens assigned to a different species (Fig. 3, Fig. S5, 
green boxes). Further analysis is required to evaluate the 
significance of these cases of similarity in DNA barcodes despite 
differing taxonomic assignments. Such cases can have several 
origins: identification errors, overlooked synonymy, incomplete 
lineage sorting or mitochondrial introgression through hybridiza- 
tion [47], or mitogenome replacement induced by Wolbachia 
infections [48]. Identification errors could not be ruled out to 
explain our results in the pair Hippotion rosetta/H. boerhaviae, two 
species long treated as one and often indistinguishable without 
genitalia dissection, and in the triplet Gnathothlibus eras/saccoi/ 
vanuatuensis, which are difficult to distinguish from wing morphol- 
ogy. Cases of overlooked synonymy account for many of the taxa 
in the Psilogramma menephron complex, and probably also the triplet 
Ambulyx wildei/ ceramensis/rudloffi (holotype sequenced) and possibly 
the pair Coenotes eremophilae/jakli (paratype sequenced). Finally, 
hybridization and associated introgression, or mitogenome 
replacement, may account for the barcode similarity between 
the morphologically very distinctive species pairs Hippotion brennus/ 
joiceyi (Fig. S4) and Theretra oldenlandiae / insignis (Fig. S7). In certain 
cases such as the triplet Theretra radio.sa/ ' queenslandi/ muricolour, 
barcode similarity may reflect the impact of two or more of these 
causal agents. 

These analyses also indicated that Australian subspecies are 
reciprocally monophyletic for 16 of the 18 species examined 
(Table S5 and Fig. S5). In ten of these cases, the Australian 
subspecies diverges by more than 2% from its closest conspecific 
subspecies with the most distant pair being Eupanacra splendens 
splendens / paradoxa with a minimum pairwise distance of 5.96%. We 
found only one pair and one triplet of subspecies that share highly 
similar DNA barcodes: (1) Theretra oldenlandiae oldenlandiae/ lewini, for 
which we found no shared haplotype, despite dense sampling (82 
samples of lewini and 6 of oldenlandiae); and (2) Theretra indistincta 
indistincta /manuselensis/papuensis, a triplet possibly representing a 
case of synonymy. 

Discussion 

Identification and discrimination of Australian sphingid 
moths 

The barcode reference library developed in this study provides 
coverage for 70 of the 72 currently valid Australian sphingid 
species and each of these species possesses a diagnostic array of 
DNA barcodes on the continent. Our results revealed seven 
species overlooked by past taxonomic work and five of these have 
recently been described, raising the species count for Psilogramma 
from five to seven, Hopliocnema from one to three, and Coenotes from 
one to two [23,41,42,44]. One species of Acosmeryx and Hippotion 
johanna need revalidation. These changes raise the species count for 
Australian sphingids to 75, a 17% increase from the last checklist 
for the family [22]. Our work also led to the unexpected discovery 
that the Convolvulus Hawkmoth, Agrius convolvuli, an emerging 
model species [49] is almost certainly two species. Its division into 
two species is relevant to all studies on this species (e.g. the 



identification of its sex pheromones [50]) and may be of practical 
importance as it is a pest on sweet potatoes, a major crop in Papua 
New Guinea [51]. 

Extent of the Linnean shortfall 

The Linnean shortfall [6] refers to the fact that most species on 
Earth are yet to be formally described, and implies that our 
current census of species (and other meaningful evolutionary units 
used by taxonomists, such as subspecies) is incomplete. In the case 
of Australian sphingids, our acquisition of DNA barcodes revealed 
seven overlooked species. While this result was perhaps unexpect- 
ed for a fauna considered as taxonomically mature, its significance 
is twofold. On one hand, it must be noted that all of the newly 
revealed species are highly similar morphologically to previously 
known taxa, and thus the integration of DNA barcodes played a 
key role in circumscribing these species and revealing their 
distinctness. On the other hand, these discoveries also reflect 
insufficient sampling effort as most of the new species originate 
from areas in the outback, where limited collecting efforts have 
occurred. In addition, our results question the taxonomic status of 
1 3 other species whose Australian lineages show marked sequence 
divergence from their counterparts outside Australia. Finally, eight 
other Australian species show close barcode congruence with 
lineages in other settings that are assigned to different species 
(Fig. 3). In total, 18 species (three show both splitting and lumping) 
- almost a quarter of the continental fauna - possess a taxonomic 
assignment that requires further study. The resolution of each case 
will require a thorough and critical revision of species diagnoses 
using an integrative approach, and will demand consideration of 
names currently listed as junior synonyms. Tackling the nomen- 
clatural part of this work is a particular challenge because 
sphingids have attracted so much attention from taxonomists 
resulting, for example, in 14 synonyms for A. convolvuli [21]. 
Fortunately, recent progress in the recovery of whole or partial 
DNA barcodes from old type specimens will aid resolution of these 
nomenclatural conundrums [32,52]. 

Extent of the Wallacean shortfall 

Knowledge of species distributions is fundamental to the design 
of sound conservation strategies and gaps in such information have 
been termed the "Wallacean shortfall" by biogeographers and 
conservation biologists [6]. Besides the scarcity of distributional 
data for most species, incorrect or incomplete delineation of 
taxonomic units will, in most cases, lead to incorrect information 
on species distributions, impeding efforts to understand the 
historical and contemporaneous processes underlying the occur- 
rence patterns of species or subspecies [6,53]. This study has 
revealed a significant mismatch between past understanding of the 
taxonomic diversity of Australian sphingids and that revealed 
through molecular analysis, especially when results are placed in a 
broader biogeographical context. The recent description of two 
Australian species each of Psilogramma and Hopliocnema [23,41,42], 
of one new species of Coenotes [44], and the separation by Vaglia & 
Haxaire [54] of the Papuan A. miskinoides from its Australian 
counterpart, A. miskini, were all motivated by DNA barcode data 
coupled subsequently with morphological analysis. These updates 
raise the number of endemic Australian species and subspecies 
from 26 to 34 (Fig. 4, Table S4). Further analysis of A. hespera may 
add another endemic species, while the possible synonymy of 
subspecies of T. indistincta may change its status from one polytypic 
species with an endemic Australian subspecies to a monotypic 
species distributed over Australia and the Malesian region. 

Our results additionally revealed that seven species with broad 
ranges (namely Cephonodes picas, Macroglossum corythus, Theretra nessus, 
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7. Cephonodes picus - distinct from SEA C. picus 

23. Macmglossum corythus - distinct from SEA & Pi M. corythus 

- 39. Theretra indistincta - lumped with EWL & WWL subspecies 
43. Theretra nessus - distinct from SEA & WWL T. nessus 

— 11. Daphnis placida - distinct from Pi, WWL & SEA D. placida 
15. Gnalhothlibus eras - distinct from Pi G. eras 



56. Agrius convolvuli - distinct from Pi, WWL, SEA & OW A. convolvuli 



68. Psilogramma menephron - distinct from WWL & SEA P. menephron 



■ 1. Acasmervx near anceus - overlooked Australian endemic 



2. Acosmetyx miskini - new endemic after description of EWL A. miskinoides 



- 35. Nephele hespera - distinct from SEA N. hespera 



" 59. Coenotes arida - newly described species 

61. Hopliocnema lacunosa - newly described species 

62. Hopliocnema ochra - newly described species 

66. Psilogramma exigua - newly described species 

70. Psilogramma penumbra - newly described species 

Figure 4. Distribution of Australian sphingid species. All 75 recognized species are linked by one or more coloured ribbons to the boundaries 
of their distributions (abbreviations as in Table 1). Species are ordered by distribution type, from the Australian endemics at the bottom left toward 
taxa with broader ranges at the top. Stars mark all species whose distributions were altered by our work, while text provides details about the cause 
of their distributional shift. The figure was assembled with the online application of Circos [57]. 
doi:10.1371/journal.pone.0101108.g004 



Daphnis placida, Nephele hespera, Agrius convolvuli, and Psilogramma 
menephron; see Fig. 4, and Fig. S5) actually comprise two (or more) 
genetically differentiated lineages likely to represent distinct 
species. In fact, in all of these cases, the genetic split is associated 
to a geographical disjunction. Although our current sampling of 
specimens with a DNA barcode remains too sparse in the region to 
precisely delimit the ranges of these newly circumscribed units, it is 
worth noting that in all cases the genetic lineage including the 
Australian representatives never extend westward beyond the 
Wallacean region. This result fits previous observations of the 
region as representing a major faunal discontinuity for Malesian 
sphingids, although it remains unclear where precisely the 
disjunctions occur and whether they match the modified course 
of Wallace's Line proposed by Beck et al. [55] for sphingids. 
Overall, our results indicate that the Australian sphingid fauna 
comprises two large biogeographical subsets (Fig. 4), one endemic 
to the continent (30 species and 4 subspecies, 45% of the fauna) 
and one including taxa also occurring on Pacific Islands and/or in 
the Papuan and Wallacean region (34 species/subspecies, 45%). A 
third small subset includes just seven Australian species (10%) 
whose distributions extend westward beyond the W allacean region 



(Fig. 4). The situation of those taxa found in Australia and on 
Pacific Islands deserves further attention. It is, for instance, 
interesting that the two species found in Australia and in the 
Pacific, but not in Papua New Guinea, occur in New Caledonia 
(N. subvaria) or New Caledonia and Fiji (Hippotion scrofa) but not 
in the Solomon Islands. In contrast, the three Australian species 
occurring on the Solomon Islands (D. moorei, T. nessus and E. 
splendens) are all also present on New Guinea. Overall, among the 
1 1 Australian species or subspecies that also occur on Pacific 
islands (Table 1; Fig. 4), eight were sampled from these islands and 
six revealed genetic divergences ranging from 1 to 2% (e.g. 1% for 
D. moorei in Solomon Islands, 1.5% for H. scrofa in New Caledonia 
and Fiji, 1.4% for A. convolvuli in New Caledonia), while two (H. 
velox and T. silhetensis intersecta) were genetically undifferentiated. 
These relatively low divergences likely reflect the recent history of 
colonization of these islands from Australia or Papua New Guinea 
during the Pleistocene, when periods of lowered sea level 
facilitated dispersal across sea gaps. Climatic events or human- 
mediated colonization might also have caused contemporaneous 
range expansion in Pacific islands; H. scrofa was for instance 
unrecorded from Fiji before 1975 despite sustained collecting 
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efforts before then (G. Robinson, personal communication), and 
the genetic identity of the specimen sampled here with one from 
New Caledonia (Fig. S5) supports this hypothesis of a recent 
colonization event. 

The taxonomic implications of these genetic divergences 
between allopatric populations require further examination, but 
these island "populations" certainly represent important evolu- 
tionary units, yet another layer of the Wallacean shortfall 
concealed by their current taxonomic treatment. We did examine 
some subspecies already described from Pacific islands and they 
show comparable levels of genetic divergence (e.g. 1.9% between 
D. p. placida and D. p. salomonis from the Solomon Islands) or even 
less (e.g. only 0.6% between T. n. nessus and T. n. albata from New 
Caledonia and Vanuatu). Judging from genetic evidence alone, the 
island lineages of some Australian species may deserve at least 
subspecific status, and confirmation by morphology should be 
sought. 

Conclusions 

The Linnean and Wallacean shortfalls have been stressed as two 
critical gaps in our knowledge of global biodiversity. Overlooked 
cryptic diversity and inadequate delineation of species boundaries 
inevitably create erroneous distribution patterns impeding the 
development of conservation strategies that focus on the right 
objects and the right places. In groups that have received little 
taxonomic attention, such as tropical and small-bodied inverte- 
brates, these shortfalls have been recognized as substantial [9,56]. 
However, this study reveals that a taxonomically mature insect 
fauna can also be strongly impacted by these shortfalls. Here, we 
have illustrated a solution involving the integration of DNA 
barcodes with an established taxonomic system based largely on 
morphology. This approach provides a robust means to identify 
and correct possible discrepancies in our current understanding of 
species and their distributions. 
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